Add olive mcp server by xiaoyu-work · Pull Request #2353 · microsoft/Olive

xiaoyu-work · 2026-03-10T01:03:31Z

Describe your changes

Add olive mcp server

Checklist before requesting a review

Add unit tests for this change.
Make sure all tests can pass.
Update documents if necessary.
Lint and apply fixes to your code by running lintrunner -a
Is this a user-facing change? If yes, give a description of this change to be included in the release notes.

(Optional) Issue link

mcp/src/olive_mcp/constants.py

mcp/src/olive_mcp/jobs.py

mcp/src/olive_mcp/worker.py

mcp/src/olive_mcp/tools.py

mcp/src/olive_mcp/worker.py

mcp/src/olive_mcp/tools.py

mcp/src/olive_mcp/worker.py

shaahji · 2026-03-18T17:02:04Z

mcp/src/olive_mcp/packages.py

+    seen = set()
+    deduped = []
+    for pkg in extra_packages:
+        if pkg not in seen:
+            seen.add(pkg)
+            deduped.append(pkg)


Does the order matter? If not, this could be simplified to following -

extra_packages = list(set(extra_packages))

shaahji · 2026-03-18T17:02:50Z

mcp/src/olive_mcp/constants.py

+    "NvTensorRTRTXExecutionProvider",
+]
+
+SUPPORTED_PRECISIONS = [


Use StrEnum?

shaahji · 2026-03-18T17:03:00Z

mcp/src/olive_mcp/constants.py

+# Constants
+# ---------------------------------------------------------------------------
+
+SUPPORTED_PROVIDERS = [


Use StrEnum?

shaahji · 2026-03-18T17:03:23Z

mcp/src/olive_mcp/constants.py

+CMD_OPTIMIZE = "optimize"
+CMD_QUANTIZE = "quantize"
+CMD_FINETUNE = "finetune"
+CMD_CAPTURE_ONNX_GRAPH = "capture_onnx_graph"
+CMD_BENCHMARK = "benchmark"
+CMD_DIFFUSION_LORA = "diffusion_lora"
+CMD_EXPLORE_PASSES = "explore_passes"
+CMD_VALIDATE_CONFIG = "validate_config"
+CMD_RUN_CONFIG = "run_config"


Combine into a named StrEnum?

shaahji · 2026-03-18T17:15:50Z

mcp/src/olive_mcp/jobs.py

+        while True:
+            try:
+                line = await proc.stderr.readline()
+            except ValueError:
+                # Line exceeded even the 10MB limit — skip it
+                continue
+            if not line:
+                break
+            decoded = line.decode("utf-8", errors="replace").rstrip()
+            if decoded:
+                # Truncate extremely long lines for display (e.g. base64 blobs)
+                if len(decoded) > 500:
+                    decoded = decoded[:500] + "... (truncated)"
+                _job_log(job_id, decoded)


This loop will block infinitely if there's nothing written to stderr. Also, an empty line will also break out of the loop which isn't intended behavior. Check explicitly for None.

if line is None: break

shaahji · 2026-03-18T17:21:56Z

mcp/src/olive_mcp/jobs.py

+            stdout=asyncio.subprocess.PIPE,
+            stderr=asyncio.subprocess.PIPE,
+            env=env,
+            limit=10 * 1024 * 1024,  # 10 MB line limit (default 64KB is too small for olive output)


You could start a slave thread to read the proc.stdout and not be limited by the size. That approach will also provide a live progress update the user rather than waiting till the end when the process completes.

shaahji · 2026-03-18T17:26:34Z

mcp/src/olive_mcp/packages.py

+    elif command == CMD_QUANTIZE:
+        algorithm = kwargs.get("algorithm", "rtn")
+        impl = kwargs.get("implementation", "olive")
+        if impl == "bnb":
+            extras.add("bnb")
+        elif impl == "inc":
+            extras.add("inc")
+        elif impl == "autogptq" or algorithm == "gptq":
+            extra_packages.extend(["auto-gptq", "optimum", "datasets"])
+        elif impl == "awq" or algorithm == "awq":
+            extra_packages.append("autoawq")
+        # Static quantization needs calibration data
+        if algorithm != "rtn":
+            extra_packages.append("datasets")


This information is available in olive_config.json. Rather not duplicate it here.

shaahji · 2026-03-18T17:29:32Z

mcp/src/olive_mcp/tools.py

+
+
+@mcp.tool()
+async def detect_hardware() -> dict:


This is all supported by python package psutils. Can we just take a dependency on that module rather than duplicating effort.

shaahji · 2026-03-18T17:35:42Z

mcp/src/olive_mcp/venv.py

+        job_log_fn(job_id, f"Reusing cached venv ({key})")
+
+    _touch_venv(venv_path)
+    return python_path


Might want to add a simple python -m pip list to show the status of the created environment.

xiaoyu-work added 2 commits March 9, 2026 17:58

Add olive mcp server

b222ef1

fix constants

d6a10bd

github-advanced-security bot found potential problems Mar 10, 2026

View reviewed changes

mcp/src/olive_mcp/tools.py Fixed Show fixed Hide fixed

mcp/src/olive_mcp/worker.py Fixed Show fixed Hide fixed

mcp/src/olive_mcp/tools.py Fixed Show fixed Hide fixed

mcp/src/olive_mcp/tools.py Fixed Show fixed Hide fixed

mcp/src/olive_mcp/worker.py Fixed Show fixed Hide fixed

xiaoyu-work added 2 commits March 9, 2026 18:40

Fix comments

2939e10

Fix format

f72f41b

devang-ml requested review from jambayk and shaahji March 18, 2026 05:30

shaahji requested changes Mar 18, 2026

View reviewed changes

Conversation

xiaoyu-work commented Mar 10, 2026

Describe your changes

Checklist before requesting a review

(Optional) Issue link

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants